Ligand-Based Virtual Screening by Novelty Detection with Self-Organizing Maps
نویسندگان
چکیده
We describe a novel method for ligand-based virtual screening, based on utilizing Self-Organizing Maps (SOM) as a novelty detection device. Novelty detection (or one-class classification) refers to the attempt of identifying patterns that do not belong to the space covered by a given data set. In ligand-based virtual screening, chemical structures perceived as novel lie outside the known activity space and can therefore be discarded from further investigation. In this context, the concept of "novel structure" refers to a compound, which is unlikely to share the activity of the query structures. Compounds not perceived as "novel" are suspected to share the activity of the query structures. Nowadays, various databases contain active structures but access to compounds which have been found to be inactive in a biological assay is limited. This work addresses this problem via novelty detection, which does not require proven inactive compounds. The structures are described by spatial autocorrelation functions weighted by atomic physicochemical properties. Different methods for selecting a subset of targets from a larger set are discussed. A comparison with similarity search based on Daylight fingerprints followed by data fusion is presented. The two methods complement each other to a large extent. In a retrospective screening of the WOMBAT database novelty detection with SOM gave enrichment factors between 105 and 462-an improvement over the similarity search based on Daylight fingerprints between 25% and 100%, when the 100 top ranked structures were considered. Novelty detection with SOM is applicable (1) to improve the retrieval of potentially active compounds also in concert with other virtual screening methods; (2) as a library design tool for discarding a large number of compounds, which are unlikely to possess a given biological activity; and (3) for selecting a small number of potentially active compounds from a large data set.
منابع مشابه
Worldscientiic/ws-b8-5x6-0 Main Chapter 2 the Self-organizing Map as a Tool in Knowledge Engineering
The Self-Organizing Map (SOM) is one of the most popular neural network methods. It is a powerful tool in visualization and analysis of high-dimensional data in various engineering applications. The SOM maps the data on a two-dimensional grid which may be used as a base for various kinds of visual approaches for clustering, correlation and novelty detection. In this chapter, we present novel me...
متن کاملNovelty detection using Self-Organizing Maps
Failure detection in process monitoring involves a classiica-tion mainly on the basis of data from normal operation. When a Self-Organizing Map is used for the description of normal system behaviour, a compatibility measure is needed for declaring a map and a dataset as matching. We propose a novel variant of one such measure and investigate usefulness of consisting and novel measures both with...
متن کاملNovelty-dependent learning and topological mapping
Unsupervised topological ordering, similar to Kohonen’s (1982) Self-organizing feature map, was achieved in a connectionist module for competitive learning (a CALM Map) by internally regulating the learning rate and the size of the active neighborhood on the basis of input novelty. In this module winner-take-all competition and the 'activity bubble' are due to graded lateral inhibition between ...
متن کاملUsing Context to Get Novel Recommendation in Internet Message Streams
Novelty detection algorithms usually employ similarity measures with the previous seen and relevant documents to decide if a document is of user’s interest. The problem that arises by using this approach is that the system might recommend redundant documents. Thus, it has become extremely important to be able to distinguish between“redundant”and “novel” information. To address this limitation, ...
متن کاملGreen Product Consumers Segmentation Using Self-Organizing Maps in Iran
This study aims to segment the market based on demographical, psychological, and behavioral variables, and seeks to investigate their relationship with green consumer behavior. In this research, self-organizing maps are used to segment and to determine the features of green consumer behavior. This was a survey type of research study in which eight variables were selected from the demographical,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of chemical information and modeling
دوره 47 6 شماره
صفحات -
تاریخ انتشار 2007